44
Literature
Eddy SR (2004) What is a hidden Markov model? Nat Biotechnol 22:1315–1316. https://doi.
org/10.1038/nbt1004-1315
Gibson DG, Benders GA, Andrews-Pfannkoch C et al (2008) Complete chemical synthesis, assem
bly, and cloning of a mycoplasma genitalium genome. Science 319(5867):1215–1220. https://
doi.org/10.1126/science.1151721
Güell M, van Noort V, Yus E et al (2009) Transcriptome complexity in a genome-reduced bacte
rium. Science 326(5957):1268–1271. https://doi.org/10.1126/science.1176951 (PubMed PMID:
19965477)
Kühner S, van Noort V, Betts MJ et al (2009) Proteome organization in a genome-reduced bacte
rium. Science 326(5957):1235–1240. https://doi.org/10.1126/science.1176343 (PubMed PMID:
19965468 *Here, genome and proteome of the small bacterial organism M. pneumoniae is
explained in an exemplary manner)
Lander ES (2011) Initial impact of the sequencing of the human genome. Nature 470(7333):187–197.
https://doi.org/10.1038/nature09792 (*Here, Eric Lander describes what followed from his first
human genome sequence ten years later)
Lander ES, Linton M, Birren B et al (2001) Initial sequencing and analysis of the human genome.
Nature 409(6822):860–921. https://doi.org/10.1038/35057062 (*The landmark paper about the
first description of the human genome)
Liu SJ, Horlbeck MA, Cho SW et al (2017) CRISPRi-based genome-scale identification of func
tional long noncoding RNA loci in human cells. Science 355(6320). pii: aah7111. https://doi.
org/10.1126/science.aah7111 (*This recent work describes that there are thousands of human
lncRNAs [over 200 nucleotides long], and 16401 lncRNA loci after they have been studied in
seven Cell lines studied in more detail. 499 lncRNAs were identified as essential for cell growth,
with 89% being cell type specific. Presumably, there are also thousands of miRNA loci; the
ENCODE consortium had evidence for many miRNAs).
Patrik D’haeseleer (2006) What are DNA sequence motifs? Nat Biotechnol 24:423–425. https://doi.
org/10.1038/nbt0406-423
Stormo G (2010) Zhao Y (2010) Determining the specificity of protein–DNA interactions. Nat Rev
Genet 11:751–760. https://doi.org/10.1038/nrg2845
Stormo GD (2013) Modeling the specificity of protein-DNA interactions. Quant Biol 1(2):115–130.
https://doi.org/10.1007/s40484-013-0012-4
Telenti A, Pierce LC, Biggs WH et al (2016) Deep sequencing of 10,000 human genomes. Proc Natl
Acad Sci U S A 113(42):11901–11906 (PubMed PMID: 27702888; PubMed Central PMCID:
PMC5081584 *This paper shows the current state of human genome sequencing: In the mean
time, even 10000 genomes can be compared on an industrial scale, for instance for conserved
single nucleotide polymorphisms. https://www.ncbi.nlm.nih.gov/pubmed/27702888)
The ENCODE Project Consortium (2012) An integrated encyclopedia of DNA elements in the
human genome. Nature 489:57–74. https://doi.org/10.1038/nature11247 (*The ENCODE con
sortium has created an encyclopedia of all DNA elements in the human genome and is about 100
times more accurate than the original initial sequencing. It also showed that about half of the
human genome is actively transcribed, much more than the protein genes [30% of the genome;
coding regions only 3%])
Venter JC, Adams MD, Myers EW et al (2001). The sequence of the human genome. Science
291(5507):1304–1351. Erratum in: Science 292(5523):1838 (PubMed PMID: 11181995 *This
is the famous human genome sequencing paper that J. Craig Venter and his little Armada of
sequencing robots accomplished in just three years)
3 Genomes: Molecular Maps of Living Organisms